Winner-Take-All EM Clustering
نویسندگان
چکیده
The EM algorithm is often used with mixture models to cluster data, but for efficiency reasons it is sometimes desirable to produce hard clusters. Several hard clustering limits of EM are known. For example, k-means clustering can be derived from EM in a Gaussian mixture model by taking the limit of all variances going to zero. We present a new method of deriving Winner-Take-All versions of EM that can be used for mixtures, such as heteroscedastic Gaussians, where it is not possible to take that limit. The resulting clusters can have non-convex boundaries, allowing for some of the clusters to reside “inside” others, producing dense foreground clusters embedded in a more diffuse background. Experiments show that using unequal variances can give better clusters on real data sets in terms of external quality measures.
منابع مشابه
An Experimental Comparison of Several Clustering and Initialization Methods
We examine methods for clustering in high dimensions. In the first part of the paper, we perform an experimental comparison between three batch clustering algorithms: the Expectation–Maximization (EM) algorithm, a “winner take all” version of the EM algorithm reminiscent of the K-means algorithm, and model-based hierarchical agglomerative clustering. We learn naive-Bayes models with a hidden ro...
متن کاملSAR imagery segmentation using probabilistic winner-take-all clustering
This paper applies a recently-developed neural clustering scheme, called "probabilistic winner-take-all (PWTA)", to image segmentation. Experimental results are presented. These results show that the PWTA clustering scheme signiicantly outperforms the popular k-means algorithm when both are utilized to segment a synthetic-aperture-radar (SAR) image representing ship targets in an open-ocean scene.
متن کاملClassifying with Gaussian Mixtures and Clusters
In this paper, we derive classifiers which are winner-take-all (WTA) approximations to a Bayes classifier with Gaussian mixtures for class conditional densities. The derived classifiers include clustering based algorithms like LVQ and k-Means. We propose a constrained rank Gaussian mixtures model and derive a WTA algorithm for it. Our experiments with two speech classification tasks indicate th...
متن کاملSAR image processing using probabilistic winner-take-all learning and artificial neural networks
This paper develops a two-stage approach for the identii-cation of ship targets in airborne synthetic aperture radar (SAR) imagery representing open ocean scenes. The rst stage of the developed approach segments the SAR image using a novel neural clustering scheme, called "probabilis-tic winner-take-all (PWTA)." As for the second stage, it employs a backpropagation (BP) neural network to classi...
متن کاملComparing Clustering Algorithms for the Identification of Similar Pages in Web Applications
In this paper, we analyze some widely employed clustering algorithms to identify duplicated or cloned pages in web applications. Indeed, we consider an agglomerative hierarchical clustering algorithm, a divisive clustering algorithm, k-means partitional clustering algorithm, and a partitional competitive clustering algorithm, namely Winner Takes All (WTA). All the clustering algorithms take as ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009